Evidence for non-Gaussianity in the DMR Four Year Sky Maps 

Pedro G. Ferreira ^, Joao Magueijo^, and Krzysztof M. Gorski^'^ 
^Center for Particle Astrophysics, University of California, Berkeley, CA94720, USA 
^Theoretical Physics, Imperial College, Prince Consort Road, London SWT 2BZ, UK 
^Theoretical Astrophysics Center, Juliane Maries Vej 30, DK-2100, Copenhagen 0, 

Denmark 

^ Warsaw University Observatory, Warsaw, Poland 



Received 



accepted 



- 2 - 



ABSTRACT 

We introduce and study the distribution of an estimator for the normahzed 
bispectrum of the Cosmic Microwave Background (CMB) anisotropy. We 
use it to construct a goodness of fit statistic to test the coadded 53 and 90 
GHz COBE-DMR 4 year maps for non-Gaussianity. Our results indicate that 
Gaussianity is ruled out at the confidence level in excess of 98%. This value is a 
lower bound, given all the investigated systematics. The dominant non-Gaussian 
contribution is found near the multipole of order £ — 16. Our attempts to 
explain this effect as caused by the diffuse foreground emission from the Galaxy 
have failed. We conclude that unless there exists a microwave foreground 
emission which spatially correlates neither with the DIRBE nor Haslam maps, 
the cosmological CMB anisotropy is genuinely non-Gaussian. 

Subject headings: Cosmology: theory - observation - the cosmic microwave 
background: tests of Gaussianity 
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1. Introduction 

We shall consider fluctuations in the CMB as a random field on the sphere, -^(n). 
One can expand such a field in terms of Spherical Harmonic functions: 

AT 



v^) = ^ aimYimiTti) (1) 
For a statistically isotropic field one has 

(a^imiOljma) = C'<?i<^<?i<?2^mim2 (2) 

We can also define the two-point function in terms of ^(n). Isotropy implies that the 
correlation matrix can only depend on the angle between the two points considered. This is 
encoded in the 2-point correlation C'^'^\9). From (|l]) and (|]) we find 

IP -I- 1 

^^'^(^)=E^^C',P,(cos^) (3) 

Hence the Ci may be regarded as a Legendre transform of the 2-point correlation function. 

It is a standard lore that, barring some mathematical obstructions, one can reconstruct 
the probability distribution function of any random field from its moments. Isotropy 
imposes "selection rules" on these moments. For instance, the 3-point moment is given by 

I ii h h \ 
\ mi m2 ms / 

where the (...) is the Wigner 3 J symbol. The coefficients Ci^i^i^ are usually called the 
bispectrum. If we assume that there are no correlations between different i multipoles 
then the only non-zero component of the bispectrum is Cm = B^. The collapsed 3-point 
correlation function C^'^\9) (the average of a temperature squared at one point, and a 
temperature at another point, separated by an angle 6) is now 

'2^ + 1^/2 / ^ ^ 



(5) 
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in analogy with (^. Hence the Bi is related to the Legendre transform of C'-^-'. The angular 
power spectrum is often considered a more powerful tool than the correlation function 
C^'^\9) for discriminating between theories, and one might argue the same way with regard 
to the reduced bispectrum and the 3-point function C^{0). 

The importance of higher order statistics for characterizing large scale structure has 
been stressed before ( Peebles 1980|) . The non-linear evolution of primordial Gaussian 



fluctuations has been analysed in detail ( Peebles 1980| , Pouchet et al. 1992|) and the 



skewness arising in such models has been shown to be consistent with current observations 
( pouchet et al. 199^j| , |Gaztahaga 1994| ). [Luo 199^ discussed the statistical properties and 



detectability of the bispectrum for a variety of non-Gaussian signals. Kogut et al. (19^F) 
measured the pseudocollapsed and equilateral three point function of the DMR four year 
data and found them to be consistent with Gaussianity. The analysis performed here should 
be considered complementary to that of [Kogut et al. (1996)| : non-Gaussian signals which 
may be obscured in real space can become evident in I space. 

In this letter we shall use a general formalism for generating estimators of higher order 
moments on a sphere ( Perreira, Gorski fc Magueijo 1998|) . In this formalism one considers 
all possible tensor products of AT^ (each multipole component of the field) and from these 
one extracts the singlet (invariant) term. In the case of bispectrum one has 




Note that only even values of ^ lead to nonzero values of the Bi due to the symmetries of the 
Wigner 3-J coefficients. In practice it is essential to factor out the power spectrum from our 
statistic. We also wish to define statistics which are invariant under parity transformations, 
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and not just rotations. Therefore we define If to be 



(C,)3/2 

where Ci = J2m k^mP- Our statistics are dimensionless and are normahzed so that a 
cyhndrically symmetric multipole has If = 1. 



(7) 



The i = 2 case was discussed and given a physical interpretation in fViagueijo (1995 



The quadrupole has 5 degrees of freedom. Of these only 2 are rotationally invariant. One is 
the quadrupole intensity C2, and tells us how much power there is in the quadrupole. The 
other is essentially /| and tells us how this power is distributed among the different a2m 
but only as far as there is a rotationally invariant meaning to the concept. For instance if 
/| = 1 then there is a frame in which all the power is concentrated in the m = mode. 
Such a quadrupole is cyhndrically symmetric, but of course the symmetry axis orientation 
is uniformly distributed, to comply with statistical isotropy. If J| = then on the contrary 
cylindrical symmetry is maximally broken. The probability distribution function of /| is 
uniform in Gaussian theories ([Magueijo (1995)| ). 



2. Goodness of fit and evidence for non-Gaussianity 

We will be testing the inverse noise variance weighted, average maps of the 53A, 53B, 
90A and 90B COBE-DMK channels, with monopole and dipole removed, at resolution 6, in 
ecliptic pixelization. We use the extended galactic cut of [Banday et al 199^ , and Pennet 



|et al 199^ to remove most of the emission from the plane of the Galaxy. We apply our 
statistics to the DMR maps before and after correction for the plausible diffuse foreground 
emission outside the galactic plane as described in |Kogut et al. (1996)| , and |G6rski et al. 



1996| . To estimate the Ifs we set the value of the pixels within the galactic cut to and the 



average temperature of the cut map to zero. We then integrate the map multiplied with 
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spherical harmonics to obtain the estimates of the a^^s and apply equations ^ and ^ 

We have used Monte Carlo simulations to find the distribution of the estimators /| 
as applied to Gaussian maps subject to DMR noise and galactic cut (see Fig. |l]). These 
distributions are very non-Gaussian. In principle this would complete the theoretical work 
required for converting the observed If (which we also plot in Fig. |l|) into a statistical 
statement on Gaussianity, but we proceed further by defining a new "goodness of fit" 
statistic as follows. 

We wish to construct a tool similar to the (often used for comparing predicted and 
observed spectra) but adapted to the non-Gaussian distributions P{If)- First, however, 
recall that if the {If} were a set of independent, A^(//^, cr^)-distributed variables, the 
usual definition of the chi squared would read 

with distributed as x^, a good fit represented by X"^ ^ 1, and X'^ <^ 1 (X^ ^ 1) 
corresponding to the unusually large (small) scatter in the data given the assumed variances. 
The distribution of X"^ is used to find the probability, given the model, of a value of X"^ as 
large or as small as the one observed. The converse probability is the confidence level for 
rejecting the model. 

Since the If distributions are non Gaussian we generalize the for a set of probability 
functions Pi{lf) associated with observations {If} by defining the following functional 

= ^E^' = ^E(-21og/'.(/|) + A), (9) 

where the constants (3^ are defined so that for each term of the sum {Xf) = 1. The definition 
reduces to the usual X"^ for Gaussian Pi. 

As an illustration let us first approximate the distributions of the If by P{If ) = 2(1 — J|) 
— a good approximation for £ around 10. Then X"^ = —2 log(l — If). Like the standard 
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one has < X'^ <^ 1 for observations close to the peak of the distribution, here at If = 0. 
Indeed X'^{0) = 0. However the peak of P{If) is far from its average, and so the standard 
would produce = at the wrong observation. For observations far from the peak of 
the distribution (but subject to the constraint If < 1) X"^ goes to infinity. In contrast the 
standard X^ would always remain finite. 

The proposed therefore does for these non-Gaussian distributions what the usual 
does for normal distributions. To illustrate its efficiency let us find its distribution. 

First note that P(X^) = exp(— X^). Consider now a X^ built from averaging the X^ of N 

independent observations: 

^^ = ^EMi-/|) (10) 

Using characteristics its distribution may be found to be 

F{Xl) = JJ^e--^'^Xl^-' (11) 

This is a x^j^- Even if the original invariants were Gaussian distributed the standard X^ 
would only be a x^. Of course for values of I away from 10 the P{lf) are very different from 
the analytical fit presented. In particular for £ = 2 the distribution is uniform, meaning 
that any observation is compatible with Gaussianity. Applying the prescription @ we have 
that indeed X2(/|) = 1. 

In practice we build a X^ for the COBE-DMR data by means of Monte Carlo 
simulations. We proceed as follows. First we compute the distributions P{lf), for 
i = 2, ... ,18, for a Gaussian process as measured subject to our galactic cut, and pixel 
noises. These P{lf) were inferred from 25000 realizations (see Fig. p. From these 
distributions we then build the X^ defined in (P), taking special care with the numerical 
evaluation of the constants (3^. We call this function X^qq^. We then find its distribution 
F(X^Q^^) from 10000 random realizations. This is very well approximated by a 
distribution with 12 degrees of freedom. If all P{If) were as in the analytical fit above. 
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we could conclude that we successfully measured an effective number of useful invariants 
equal to 6. This is less than the number of invariants we actually measured (10) and this 
is simply due to anisotropic noise and galactic cut. However, had we used a standard 
statistics the effective number of useful invariants would be only 3. 

We then compute ^^obb with the actual observations and find X^q^^ = 1.81. 
One can compute P{Xqq^^ < 1.81) = 0.98. Hence, it would appear that we can reject 
Gaussianity at the 98% confidence level. 

3. Discussion 

The result that we have obtained raises a number of questions which we shall attempt 
to answer. From Fig. |l] it is clear that Jfg is far in the tail of the Gaussian ensemble and 
it dominates the statistic. One would like to understand the importance of both cosmic 
variance and noise to this measurement. We would also like to assess the extent to which a 
galactic foreground contaminant could be responsible for this result. 

In order to answer the first question we look for Bayesian estimates for the If as they 
are for our sky. To do this we first estimate what the temperature fluctuations Tj = -^(rij) 
in each pixel i in our dataset are likely to be, given DMR observations O,, and noises af. 
We construct the posterior P{Ti\Oi) assuming uniform priors in Tj, and also that a priori no 
correlations exist between the Tj. The latter assumption is often used in image restoration 
algorithms, such as maximum entropy methods. We then produce an ensemble of skies 
with the distribution P{Ti\Oi). From it we infer P{lf\Oi), the distributions for what the If 
for our sky are likely to be given DMR observations and noise. This procedure will allow 
us to assess the importance of noise in each of our measurements. However note that this 
analysis is totally decoupled from the result in the previous section where all we need to 
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know are the observed /|, not their estimates for our sky. 

In Fig. 1^ we plot in dotted hnes P{lf\Oi) for our data set. We also plot in solid lines 
the cosmic variance distribution of If in skies with the same galactic cut. The vertical 
line is the observed invariant If{Oi). As expected we see that, as i gets larger, the spread 
in P{If\Oi) due to noise becomes more important, at £ = 18 dominating the distribution 
function. On the other hand we clearly have succeeded in making measurements for 
i = 4,6, 8, 12, 14, 16. For them P{lf\Oi) are peaked and clearly different from the cosmic 
variance distribution. The fact that P(/j'g|Oj) does not peak at /j'g(Oj) is merely a failure of 
the prior. The measurement of /fg(Oj) is therefore a signal and is not dominated by noise. 
We have further checked that the signal to noise in power at £ = 16 is of order 1. 

Next we wish to know if galactic emissions could be blamed for this result. We can 
proceed in three ways. Firstly we may use instead the DMR cosmic emission maps, where 
a linear combination of the various DMR channels is used to separate out the foreground 
Galactic contamination. In these maps the noise level is considerably higher. Plotting 
the counterpart of Fig. ^ for this case we find that the distributions of the actual If for 
our sky, given noise induced errors, are very similar to their cosmic variance distributions. 
The measurement is therefore dominated by noise and inconclusive. We find Xqq^^ = .4, 
consistent with Gaussianity, but this is a mere check of the Gaussianity of noise. Hence this 
approach towards foregrounds turns into a dead end, but serves to show how large angle 
Gaussian tests is a field constrained by noise, not cosmic variance. 

As an alternative approach we may subject galactic templates to the same analysis. 
At the observing frequencies the obvious contaminant should be foreground dust emission. 
The DIRBE maps ( |Boggess et al. 1992| ) supply us with a useful template on which we can 
measure the Ifs. We have done this for two of the lowest frequency maps, the 100 fim and 
the 240 /xm maps. The estimate is performed in exactly the same way as for the DMR data 
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(i.e. using the extended Galaxy cut). We performed a similar exercise with the Haslam 
408Mhz ([Haslam (1982)| ) map. We display their values in Fig. ^. As expected the two 
maps have consistent values for the If. However they do not have a non-Gaussian value at 
£ = 16. Indeed for all i the If are within Gaussian cosmic variance error bars. This is not 
surprising. DIRBE maps exhibit structures on very small scales. These should average into 
a Gaussian field when subject to a 7° beam. 

As a third alternative we may use foreground corrected maps. In these one corrects 
the coadded 53 and 90 Ghz maps for the DIRBE correlated emission. We have considered 
corrected maps in ecliptic and galactic frames, and also another map made in the ecliptic 
frame but with the DIRBE correction forced to have the same coupling as determined in 
the galactic frame. As shown in Fig. ^, in all of these the non-Gaussian signal at £ = 16 is 
enhanced, although we observe large variations in /| at £ = 4 — 8 (a phenomenon noticed 
before when estimating Cg-s). In fact the corrected maps exclude Gaussianity at the 
confidence level of 99.5%. 

It would be interesting to relate our result to the curious dip in power at £ ~ 16 
provided by the maximum likelihood estimates in porski (1997)| . These show that, assuming 
a Gaussian signal, the power in signal and noise is unusually low at £ ~ 16. One wonders 
how this would be affected if non-Gaussian degrees of freedom were allowed into the 
estimation ( [Ferreira, Gorski fc Magueijo 1998|) . 

We have also subjected our work to a variety of numerical tests. Arbitrary rotations 
of the coordinate system affects results to less than a part in 10^. More importantly, 
comparing data pixelized in the ecliptic and galactic frames, we found that our results were 
very robust, indeed more so than the power spectrum estimation (see the bottom pannel of 
Fig. 1^). We also tried different galactic cuts, and found that although the non-Gaussian 
signal gets transferred to other i, one does not fully erase it until a cut of ±40° is applied. 
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Finally we checked the effect of varying the offset in the cut map. We found that for 
any other prescription than the one used the effect is enhanced, often leading to rejecting 
Gaussianity at more than the 99.5% confidence level. 

To conclude, we have not been able to attribute our result to a known contaminating 
source or a systematic. Indeed the confidence level quoted refers to the worst result obtained 
within the set of effects explored. Of course it is always possible that this non-Gaussian 
signal comes from some yet unmapped foreground, which cannot be separated from the 
CMB anisotropy signal in the COBE-DMR data — the poorly known free-free emission 
from the Galaxy comes to mind here. 

If indeed our results arc due to a foreground contamination one should note the 
following two points. First, we would have demonstrated that DMR data is more 
contaminated by foregrounds than thought before. Second, Galactic emissions on the scales 
considered are often assumed to be Gaussian. In fact this assumption is used in subtraction 
algorithms based on the idea of optimal filtering. The discovery of a distinctly non-Gaussian 
galactic emission would in the very least require a rethinking of the foreground subtraction 
algorithms. 

If, on the other hand, the CMB signal itself is demonstrably non-Gaussian, we would 
not need to over-emphasise the epistemological implications of our findings. 
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Fig. 1. — The vertical thick dashed hne represents the value of the observed If. The sohd 
line is the probability distribution function of If for a Gaussian sky with extended galactic 
cut and DMR noise. 
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Fig. 2. — The vertical thick dashed hne represents the value of the observed /|, that is 
If{Oi). The dotted line is the distribution of where the If for our sky are likely to be, 
given the observations Oi and noise, that is P{If\Oi). The sohd line is the cosmic variance 
distribution of If for a Gaussian process (subject to the same cut, but with no noise). 
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Fig. 3. — In the bottom panel we plot the measured le from the data in ecliptic coordinates 
(open circles) and galactic coordinates (sohd circles); the center panel has the foreground 
corrected data in ecliptic coordinates (open circles) and galactic coordinates (solid circles) 
and ecliptic coordinates with galactic frame coupling (stars); the top panel has the DIRBE 
lOOyum (open circles ) DIRBE 240y[/m (solid circles ) and Haslam (stars ) data. 



